CDS
Accession Number | TCMCG075C00197 |
gbkey | CDS |
Protein Id | XP_017980736.1 |
Location | complement(join(696575..696790,696925..698067,698404..698544,698682..698724,698821..698890,698977..699171,699978..700067,700195..700321,700845..701004,701169..701416,701662..702459,702600..702842,702957..703247)) |
Gene | LOC18610744 |
GeneID | 18610744 |
Organism | Theobroma cacao |
Protein
Length | 1254aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018125247.1 |
Definition | PREDICTED: histidine kinase 1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGTGCATACACTGCTTCTTACATGGTTTCAAAAAAATGGCAGAAAAAGCATTTGAAACCCCTCCTAGAAGTAGTTCTGAAAGTTCACCAATTCTAGCCTCTCCAATGGCTACTCCTTTGAGAAAGGTGTTCAATAGAATTTCGGGTTTTGCTTCATCTTGGGGAAGGAAAACGGCTCCCCGGGGTGGTAGGATTTTCCATAGGGATGTGGAACAGGAAGAATTCCAGTACGCAAGTACTCAGTGTCTCTCATCATACTACAGTGTTTTTGTAGCTCGCCTTGCCATCATGGTCATGCTAGCTATTTTGATTGGGCTGCTAACCATTCTAACGTGGCATTTCACAAGGATCTACACAACAAGATCACTAAACACCTTAGCTTTCGGTCTTCGTTATGAGCTTCTTCAGCGGCCTATCTTACGGATGTGGAACATCTTGAATTCTACCTCAGAAATAACAACTACCCAGGTCAAGTTGTCAGAATATGTAATCAAGCGGTACAGCAAGCCTACCACTCAGGCAGAACAAGTTGAGCTGTATCAAATGATGAAAGATGTAACATGGGCACTATTTGCCAGTCGGAAGGCTCTCAATGCCATAACCATAAATTACAAAAATGGATTTGTCCAGGCATTCCACAGAGACCACAGGAGTAACAATACATTTTACATCTACTCTGATCTTGTAAATTATTCAATCAGTACCAGTGAGTCTTATGACACCAAAATGCTGACATCACGACAAGGATGGAACGACCAATTCTTTCATGGTAATTTTTCTGCGATCTGGTACCGTGAACCACTTGATCCTGTCACTGGAGAGAAGACAGGAAAGGCAAAGCCAATTCCACCCGATGATCTTATCAATATAGCAGGCCCTTCACAAGTGCCTGATGGGGTAGCTTCATGGCATGTGGCAGTGAGCAAGTACACAGATTCACCGTTGCTTTCAGCAGCTCTTCCTGTACGGGACGCTTCGAACACAAGTATAGTAGCTGTTGTAGGTGTCACCACAGCACTTTATAGTGTAGGCCAGCTGATGAAAGAACTGGTTGAAGTGCACAGCGGGTACATATATTTGACCTCACAAGAGGGTTACTTACTGGCTACATCCACAAATGCTCCTCTGTTAAAAAATACAACAAAGGGCCCTAAGCTTATGATGGCTGTTGATTCAGAGGATCATGTGATACGAATGGGGGCCCAGTGGCTACAGAATGCCTATGGAAACAAGTTCCCTCCTGGTCATGTGGTTCATGTGGAGAATGCCAATCTTGGCGGCAAGCATTATTACATTGATTCATTTTTTCTGAACCTAAAAAGACTACCAATGGTTGGAGTTATCATCATTCCAAGAAAATATATAATGGGGAAGGTTGATGAAAGAGCCTTGAAAACCTTAATCATATTGATATCTGCATCTGTATGTATCCTAGTCATTGGATGTGTCTGTATTTTGATACTGACAAATGGAGTATCAAAGGAAATGAAACTCAGAGCAGAGTTGATAAGTCATCTTGATGCAAGAAGAAGAGCAGAAGCATCAAGCAATTATAAAAGCCAATTCCTTGCAAATATGAGTCATGAACTAAGGACGCCTATGGCTGCTGTTATTGGATTGTTGGACATTCTTATCTGTGATGACTGTCTCACAAATGAACAGTATGCAATGGTTACCCAGATCCGTAAATGCTCAACTGCTCTACTCCGGCTTCTCAACAACATATTGGACTTGAGCAAGGTTGAATCTGGAAAACTGGTGTTAGAAGAAACTGAGTTTGACTTGGGACGGGAACTTGAAGGACTCGTTGATATGTTCTCTGTGCAGTGCATTAACCACAATGTGGAGACTGTTCTGGATCTCTCTGATGACATTCCCAAATTAGTTAGAGGAGACTCTGCCAGAGTTGTTCAAGTTTTTGCAAACCTAATAAGCAATTCCATCAAGTTCACAACATCTGGTCATATCATCCTGCGCGGATGGTGTGAGAATCCCAATGTGTCTAGTGATTCTGGGAAGTTCTCTCCCGATCGGAAGAAATCACTGTCTGCACTAAGGACAAAGTTGAAGCAACATGGAAACCATATGAAGAAGGCCAGCAAGAGAGACAACAAAATGATTCTTTGGTTTGAAGTTGATGACACAGGCTGCGGAATTGATCCAAGCAAATGGGAATCTGTGTTTGAGAGCTTTGAGCAAGCTGATCCTTCAACAACTCGGACGCACGGTGGCACAGGTCTTGGACTCTGCATTGTGAAAACCTTGGTTCACAAGATGGGTGGAGAAATCAAGGTTGTGAAAAAGAATGGTCCTGGTACTCTAATGAGACTATTCTTGCTTCTCAGTACTCCTGCAGATGGCACAGAACAACATGGTCAAGTGGATTTTGCAAAGCACAGTGTAGCAGTGATCCTTGCGCTAAACGGCAGCATGGGTAGATTGATTATGTCCCAGTGGTTGTCTAGAAATGGAGTACCCACTTTGGAAGCATCTGAGTGGAATGAACTGACACAAATCCTTCATGAACTGTTTCATGCCAGGACTCGTAATTGTGGTTTTGATTCTCATTATTCACTAAATGAAACACTGAGATCTAAAGTACACTGCATACAAGACATGAGGAGCCCAGCTTACGTTATAGTTGTTGATCTGGGGCTGCTTGACTTGAGCACAGATATATGGAAAGAACAGCTTAATTTTCTTGACAAATTCTCTGGCCAAGTGAAATTTGCATGGATGCTGAATCATGATACTTCCAATGCTATAAAGATGGAGCTCCGCAGGAAAGGACATATATTGATGGTTAACAAGCCACTGTACAAGGCAAAAATGCTTCATATTTTGGAAGCTGTCATAAAGGAGAGATACGTTGAACTTCAAAAGAGAAGGACAAATGGAACAAAAGGTACAGCGAAAGAAGGTGATTCTCATGAGTGTCTCGAGATCGATTCATCTCATTTTGAGACTTGCAGCTCTGATGATTCTGACAATTCTGAATCGGGTGGCACTAATTCTGTAAGTTCTGTGCATACTGGAGAGGAAATAAGAGAAGGAACCGTGAAATCCAGTCCATCAAATTGCCAGACACTTAAGAACTGCCTAGTTGAATTCACGCATTTAGGTTCAGAAGTAAACGGTCTTAGGGCAGAAGAAGACCAGTGCAATGCCAGGCCTAAGTTACATGATACTGAAGATACCAAATATGAAAGTTCCAATTCCCCAGAACAACATTCGGTCAGCAGCAGTGCTAAAGATAGAGACGATTCATATACAAGTAAGGCAGCAAATGGACAGAAATCTCTTGAAGGCCTGCGGATACTGCTTGCAGAAGACACACCAGTTCTCCAAAGGGTAGCAACCATCATGCTGGAAAAAATGGGAGCTACAGTAATTGCTGTTGGGGATGGACTGCAGGCAGTAGACGCCCTGAACTGCGTGCTCAATGGAGAAGAGTATAGAAGGGACTCCTCATTGCAAGAAAGGAGGAACAGACTACAGACAGAAATTAGTGATTCTCCTCCATATGATTTGATCTTAATGGATTGCCAAATGCCAAAGATGGATGGATATGAAGCAACAAAAGCAATCAGGAAATCAGAAGCAGGAACCGGCTGGCACATACCTATTGTTGCCTTGACAGCCCATGCGATGTCATCAGATGAAGCAAAATGCTTGGAGGTGGGCATGGATGCTTATCTAACAAAGCCAATTGACTACAAGTTGATGGTGTCCACCATTCTTTCACTCACCAAAAGATCAGCCTAA |
Protein: MCIHCFLHGFKKMAEKAFETPPRSSSESSPILASPMATPLRKVFNRISGFASSWGRKTAPRGGRIFHRDVEQEEFQYASTQCLSSYYSVFVARLAIMVMLAILIGLLTILTWHFTRIYTTRSLNTLAFGLRYELLQRPILRMWNILNSTSEITTTQVKLSEYVIKRYSKPTTQAEQVELYQMMKDVTWALFASRKALNAITINYKNGFVQAFHRDHRSNNTFYIYSDLVNYSISTSESYDTKMLTSRQGWNDQFFHGNFSAIWYREPLDPVTGEKTGKAKPIPPDDLINIAGPSQVPDGVASWHVAVSKYTDSPLLSAALPVRDASNTSIVAVVGVTTALYSVGQLMKELVEVHSGYIYLTSQEGYLLATSTNAPLLKNTTKGPKLMMAVDSEDHVIRMGAQWLQNAYGNKFPPGHVVHVENANLGGKHYYIDSFFLNLKRLPMVGVIIIPRKYIMGKVDERALKTLIILISASVCILVIGCVCILILTNGVSKEMKLRAELISHLDARRRAEASSNYKSQFLANMSHELRTPMAAVIGLLDILICDDCLTNEQYAMVTQIRKCSTALLRLLNNILDLSKVESGKLVLEETEFDLGRELEGLVDMFSVQCINHNVETVLDLSDDIPKLVRGDSARVVQVFANLISNSIKFTTSGHIILRGWCENPNVSSDSGKFSPDRKKSLSALRTKLKQHGNHMKKASKRDNKMILWFEVDDTGCGIDPSKWESVFESFEQADPSTTRTHGGTGLGLCIVKTLVHKMGGEIKVVKKNGPGTLMRLFLLLSTPADGTEQHGQVDFAKHSVAVILALNGSMGRLIMSQWLSRNGVPTLEASEWNELTQILHELFHARTRNCGFDSHYSLNETLRSKVHCIQDMRSPAYVIVVDLGLLDLSTDIWKEQLNFLDKFSGQVKFAWMLNHDTSNAIKMELRRKGHILMVNKPLYKAKMLHILEAVIKERYVELQKRRTNGTKGTAKEGDSHECLEIDSSHFETCSSDDSDNSESGGTNSVSSVHTGEEIREGTVKSSPSNCQTLKNCLVEFTHLGSEVNGLRAEEDQCNARPKLHDTEDTKYESSNSPEQHSVSSSAKDRDDSYTSKAANGQKSLEGLRILLAEDTPVLQRVATIMLEKMGATVIAVGDGLQAVDALNCVLNGEEYRRDSSLQERRNRLQTEISDSPPYDLILMDCQMPKMDGYEATKAIRKSEAGTGWHIPIVALTAHAMSSDEAKCLEVGMDAYLTKPIDYKLMVSTILSLTKRSA |